Learning the Logic of Simple Phonotactics
Identifieur interne : 001C36 ( Main/Exploration ); précédent : 001C35; suivant : 001C37Learning the Logic of Simple Phonotactics
Auteurs : F. Tjong Kim Sang [Belgique] ; John Nerbonne [Pays-Bas]Source :
- Lecture Notes in Computer Science [ 0302-9743 ] ; 2000.
Abstract
Abstract: We report on experiments which demonstrate that by abductive inference it is possible to learn enough simple phonotactics to distinguish words from non-words for a simplified set of Dutch, the monosyllables. The monosyllables are distinguished in input so that segmentation is not problematic. Frequency information is withheld as is negative data. The methods are all tested using ten-fold cross-validation as well as a fixed number of randomly generated strings. Orthographic and phonetic representations are compared. The work presented in this chapter is part of a larger project comparing different machine learning techniques on linguistic data.
Url:
DOI: 10.1007/3-540-40030-3_7
Affiliations:
- Belgique, Pays-Bas
- Groningue (province), Province d'Anvers
- Anvers, Groningue (ville)
- Université d'Anvers, Université de Groningue
Links toward previous steps (curation, corpus...)
- to stream Istex, to step Corpus: 001E31
- to stream Istex, to step Curation: 001D08
- to stream Istex, to step Checkpoint: 001238
- to stream Main, to step Merge: 001D35
- to stream Main, to step Curation: 001C36
Le document en format XML
<record><TEI wicri:istexFullTextTei="biblStruct:series"><teiHeader><fileDesc><titleStmt><title xml:lang="en">Learning the Logic of Simple Phonotactics</title>
<author><name sortKey="Tjong Kim Sang, F" sort="Tjong Kim Sang, F" uniqKey="Tjong Kim Sang F" first="F." last="Tjong Kim Sang">F. Tjong Kim Sang</name>
</author>
<author><name sortKey="Nerbonne, John" sort="Nerbonne, John" uniqKey="Nerbonne J" first="John" last="Nerbonne">John Nerbonne</name>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:70C7D08C9A5C9D7C6CC886ADF30144BD359087A2</idno>
<date when="2000" year="2000">2000</date>
<idno type="doi">10.1007/3-540-40030-3_7</idno>
<idno type="url">https://api.istex.fr/document/70C7D08C9A5C9D7C6CC886ADF30144BD359087A2/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">001E31</idno>
<idno type="wicri:Area/Istex/Curation">001D08</idno>
<idno type="wicri:Area/Istex/Checkpoint">001238</idno>
<idno type="wicri:doubleKey">0302-9743:2000:Tjong Kim Sang F:learning:the:logic</idno>
<idno type="wicri:Area/Main/Merge">001D35</idno>
<idno type="wicri:Area/Main/Curation">001C36</idno>
<idno type="wicri:Area/Main/Exploration">001C36</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title level="a" type="main" xml:lang="en">Learning the Logic of Simple Phonotactics</title>
<author><name sortKey="Tjong Kim Sang, F" sort="Tjong Kim Sang, F" uniqKey="Tjong Kim Sang F" first="F." last="Tjong Kim Sang">F. Tjong Kim Sang</name>
<affiliation wicri:level="4"><country xml:lang="fr">Belgique</country>
<wicri:regionArea>CNTS - Language Technology Group, University of Antwerp</wicri:regionArea>
<placeName><settlement type="city">Anvers</settlement>
<region type="district" nuts="2">Province d'Anvers</region>
</placeName>
<orgName type="university">Université d'Anvers</orgName>
</affiliation>
<affiliation wicri:level="1"><country wicri:rule="url">Belgique</country>
</affiliation>
</author>
<author><name sortKey="Nerbonne, John" sort="Nerbonne, John" uniqKey="Nerbonne J" first="John" last="Nerbonne">John Nerbonne</name>
<affiliation wicri:level="4"><country xml:lang="fr">Pays-Bas</country>
<wicri:regionArea>Alfa-informatica, BCN, University of Groningen</wicri:regionArea>
<placeName><settlement type="city">Groningue (ville)</settlement>
<region>Groningue (province)</region>
</placeName>
<orgName type="university">Université de Groningue</orgName>
</affiliation>
<affiliation wicri:level="1"><country wicri:rule="url">Pays-Bas</country>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series><title level="s">Lecture Notes in Computer Science</title>
<imprint><date>2000</date>
</imprint>
<idno type="ISSN">0302-9743</idno>
<idno type="ISSN">0302-9743</idno>
</series>
<idno type="istex">70C7D08C9A5C9D7C6CC886ADF30144BD359087A2</idno>
<idno type="DOI">10.1007/3-540-40030-3_7</idno>
<idno type="ChapterID">7</idno>
<idno type="ChapterID">Chap7</idno>
</biblStruct>
</sourceDesc>
<seriesStmt><idno type="ISSN">0302-9743</idno>
</seriesStmt>
</fileDesc>
<profileDesc><textClass></textClass>
<langUsage><language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en">Abstract: We report on experiments which demonstrate that by abductive inference it is possible to learn enough simple phonotactics to distinguish words from non-words for a simplified set of Dutch, the monosyllables. The monosyllables are distinguished in input so that segmentation is not problematic. Frequency information is withheld as is negative data. The methods are all tested using ten-fold cross-validation as well as a fixed number of randomly generated strings. Orthographic and phonetic representations are compared. The work presented in this chapter is part of a larger project comparing different machine learning techniques on linguistic data.</div>
</front>
</TEI>
<affiliations><list><country><li>Belgique</li>
<li>Pays-Bas</li>
</country>
<region><li>Groningue (province)</li>
<li>Province d'Anvers</li>
</region>
<settlement><li>Anvers</li>
<li>Groningue (ville)</li>
</settlement>
<orgName><li>Université d'Anvers</li>
<li>Université de Groningue</li>
</orgName>
</list>
<tree><country name="Belgique"><region name="Province d'Anvers"><name sortKey="Tjong Kim Sang, F" sort="Tjong Kim Sang, F" uniqKey="Tjong Kim Sang F" first="F." last="Tjong Kim Sang">F. Tjong Kim Sang</name>
</region>
<name sortKey="Tjong Kim Sang, F" sort="Tjong Kim Sang, F" uniqKey="Tjong Kim Sang F" first="F." last="Tjong Kim Sang">F. Tjong Kim Sang</name>
</country>
<country name="Pays-Bas"><region name="Groningue (province)"><name sortKey="Nerbonne, John" sort="Nerbonne, John" uniqKey="Nerbonne J" first="John" last="Nerbonne">John Nerbonne</name>
</region>
<name sortKey="Nerbonne, John" sort="Nerbonne, John" uniqKey="Nerbonne J" first="John" last="Nerbonne">John Nerbonne</name>
</country>
</tree>
</affiliations>
</record>
Pour manipuler ce document sous Unix (Dilib)
EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 001C36 | SxmlIndent | more
Ou
HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 001C36 | SxmlIndent | more
Pour mettre un lien sur cette page dans le réseau Wicri
{{Explor lien |wiki= Ticri/CIDE |area= OcrV1 |flux= Main |étape= Exploration |type= RBID |clé= ISTEX:70C7D08C9A5C9D7C6CC886ADF30144BD359087A2 |texte= Learning the Logic of Simple Phonotactics }}
This area was generated with Dilib version V0.6.32. |